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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT : Jung, Rudolf 

Beach, Larry R. 
Dress, Virginia M. 
Rao, A. Gururaj 
Ranch, Jerome P. 
Ertl, David S. 
Higgins, Regina K. 

(ii) TITLE OF THE INVENTION: Alteration of Amino Acid^mpositions 

in Seeds 

(iii) NUMBER OF SEQUENCES : 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pioneer Hi-Bred International, I. 

(B) STREET: 7100 NW 62nd Avenue, P.O. Box 1000 

(C) CITY: Johnston 

(D) STATE: I A 

(E) COUNTRY: USA 

(F) ZIP: 50131 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(Viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Michel, Marianne H 

(B) REGISTRATION NUMBER: 3 5,286 

(C) REFERENCE /DOCKET NUMBER: 0815 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 515-334-4467 

(B) TELEFAX: 515-334-6883 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3363 base pairs 

(B) TYPE: nucleic acid 

-41- 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



TCCACCTCGA GGGGGGGCCC GGTACCCAGC TTTTGTTCCC TTTAGTGAGG GTTAATTGCG 
CGCTTGGCGT AATCATGGTC ATAGCTGTTT CCTGTGTGAA ATTGTTATCC GCTCACAATT 
CCAcIc^cI TACGAGCCGG AAGCATAAAG TGTAAAGCCT GGGGTGCCTA ATGAGTGAGC 
rZcTC^lr TAATTGCGTT GCGCTCACTG CCCGCTTTCC AGTCGGGAAA CCTGTCGTGC 
ca^SgcaS aa^gaatcgg CCAACGCGCG GGGAGAGGCG GTTTGCGTAT TGGGCGCTCT 
^CCGCT™ CGCTCACTGA CTCGCTGCGC TCGGTCGTTC GGCTGCGGCG AGCGGTATCA 
gctcactcaI AGGCGGTAAT ACGGTTATCC ACAGAATCAG GGGATAACGC AGGAAAGAAC 
"ZZ AAGGC C AGC A AAAGGCCAGG AACCGTAAAA AGG C CG CGTT GCTGGCGTTT 
TTCCATAGGC TCCGCCCCCC TGACGAGCAT C AC AAAAAT C GACGCTCAAG TCAGAGGTGG 
CGA^CCCGA CAGGACTATA AAG AT AC C AG GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC 
TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG CCTTTCTCCC TTCGGGAAGC 
gtcgcgctS ctcatagctc ACGCTGTAGG TATCTCAGTT CGGTGTAGGT CGTTCGCTCC 
aStIggS CTCTGCACGA ACCCCCCGTT CAGCCCGACC GCTGCGCCTT ATCCGGTAAC 
TATCGTCTTG AOTCcScCC GGTAAGACAC GACTTATCGC CACTGGCAGC AGCCACTGGT 
ScaggaSa gcIgag^gag GTATGTAGGC GGTGCTACAG AGTTCTTGAA GTGGTGGCCT 
AACTACGGCT ACaSaGAAG GACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC 

t^cgSaaaa! gIgt^gtag CTCTTGATCC GGCAAACAAA ccaccgctgg tagcggtggt 
Stttt^ gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg 
™ cSggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc 
^gIgI™ caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa 
ZZtcIiZ gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag 
gcacctatct cIgcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg 
?Sa™tI cSatacggga gggcttacca tctggcccca gtgctgcaat oataccgcga 
gaIccacgct caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag 
ScagaIgS Scctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa 
gSag^aI gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc 
IZcIgZ^ cSgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca 
aggcgag™ catgScccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg 

ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG TTATCACTCA TGGTTATGGC AGCACTGCAT 
SSSSS CTC?CA^GCC AT C CGTAAGA TGCTTTTCTG TGACTGGTGA GTACTCAACC 
iiSSSS gIgaatagtg TATGCGGCGA CCGAGTTGCT CTTGCCCGGC GTCAATACGG 
^ZfZZ cScctcATAG CAGAACTTTA AAAGTGCTCA TCATTGGAAA ACGTTCTTCG 
tZ^lc TCTCA^GGAT CTTACCGCTG TTGAGATCCA gttcgatgta acccactcgt 
gcaccca^? ga?c?tcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca 

GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC GGAAATGTTG AATACTCATA 
c1cttcc??t~ t~tcaatatta TTGAAGCATT TATCAGGGTT ATTGTCTCAT gagcggatac 
Sa^gaI? g^tagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa 
gtcccIcctI Sttgtaagc gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa 
tcagScaS t^taaccaa taggccgaaa tcggcaaaat cccttataaa tcaaaagaat 
aSccgaSS Igggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg 
Tof^ccZ cg?caIISgg cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac 
C^CACCCTt atcaagtttt ttggggtcga ggtgccgtaa agcactaaat cggaacccta 
AACGGAGCCC ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag 
ggaagaaagc gaa^ggIgcg ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg 

TAA^clc AC^CGCCGCG CTTAATGCGC CGCTACAGGG CGCGTCCCAT TCGCCATTCA 
GG^TGCGCA^ CTGTTGGGAA GGGCGATCGG TGCGGGCCTC TTCGCTATTA CGCCAGCTGG 
CGAAAGGGGG SgTGCTGCA AGGCGATTAA GTTGGGTAAC GCCAGGGTTT TCCCAGTCAC 
GAC^?G?S AACGACGGCC AGTGAGCGCG CGTAATACGA CTCACTATAG GGCGAATTGG 
SScCACC^ CGGTGGCGGC CGCTCTAGAA CTAGTGGATC CGTCGACTAG AGGGCCCGAC 
cIZZctTa GGCACTAAGG GATGTGAGGC CAGCATCACC GTTGCAGAAA TTGACACAAG 
StcSIca atStccaaa TAGAGTTTCA TTTCTTCGTC GTCAGCAGCT GCGTTGACCA 



60 
120 
180 
240 
300 
3S0 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 



TGTAGTCACA 
CTGAGTTGGA 
GTTTACACCT 
AGCAGTTTCT 
GAACCAACCC 
TGG 



CATGGAAGCC CTACACCCCA AGCCTGTAGG Zl^O SS 
CACAAGGGCC AATTTGGGGA AGCCTGTAGG ^ATTTTC CAAAGGTTGT 
ACAGACGCCT GCGCATAACT jTCTOMCMC £££££ ACCTGTTCGA 
CAGTATAAGT ESS GAGGCCCTTG CTGGTGGCCA 



3120 
3180 
3240 
3300 
33S0 
3363 



2) INFORMATION FOR SEQ ID NO: 



L) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

ii) MOLECULE TYPE: Other 



i) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



TCGACCTCGA 
CGCTTGGCGT 
CCACACAACA 
TAACTCACAT 
CAGCTGCATT 
TCCGCTTCCT 
GCTCACTCAA 
ATGTGAGCAA 
TTCCATAGGC 
CGAAACCCGA 
TCTCCTGTTC 
GTGGCGCTTT 
AAGCTGGGCT 
TATCGTCTTG 
AACAGGATTA 
AACTACGGCT 
TTCGGAAAAA 
TTTTTTGTTT 
ATCTTTTCTA 
ATGAGATTAT 
TCAATCTAAA 
GCACCTATCT 
TAGATAACTA 
GACCCACGCT 
CGCAGAAGTG 
GCTAGAGTAA 
ATCGTGGTGT 
AGGCGAGTTA 
ATCGTTGTCA 
AATTCTCTTA 
AAGTCATTCT 
GATAATACCG 
GGGCGAAAAC 
GCACCCAACT 
GGAAGGCAAA 
CTCTTCCTTT 
ATATTTGAAT 
GTGCCACCTA 
TCAGCTCATT 



GGGGGGGCCC 
AATCATGGTC 
TACGAGCCGG 
TAATTGCGTT 
AATGAATCGG 
CGCTCACTGA 
AGGCGGTAAT 
AAGGCCAGCA 
TCCGCCCCCC 
CAGGACTATA 
CGACCCTGCC 
CTCATAGCTC 
GTGTGCACGA 
AGTCCAACCC 
GCAGAGCGAG 
AC AC T AG AAG 
GAGTTGGTAG 
GCAAGCAGCA 
CGGGGTCTGA 
CAAAAAGGAT 
GTATATATGA 
CAGCGATCTG 
CGATACGGGA 
CACCGGCTCC 
GTCCTGCAAC 
GTAGTTCGCC 
CACGCTCGTC 
CATGATCCCC 
GAAGTAAGTT 
CTGTCATGCC 
GAGAATAGTG 
CGCCACATAG 
TCTCAAGGAT 
GATCTTCAGC 
ATGC CGC AAA 
TTCAATATTA 
GTATTTAGAA 
AATTGTAAGC 
TTTTAACCAA 



GGTACCCAGC 
ATAGCTGTTT 
AAGCATAAAG 
GCGCTCACTG 
CCAACGCGCG 
CTCGCTGCGC 
ACGGTTATCC 
AAAGGCCAGG 
TGACGAGCAT 
AAG AT AC C AG 
GCTTACCGGA 
ACG CTGTAGG 
ACCCCCCGTT 
GGTAAGACAC 
GTATGTAGGC 
GACAGTATTT 
CTCTTGATCC 
GATTACGCGC 
CGCTCAGTGG 
CTTCACCTAG 
GTAAACTTGG 
TCTATTTCGT 
GGGCTTACCA 
AGATTTATCA 
TTTATCCGCC 
AGTTAATAGT 
GTTTGGTATG 
CATGTTGTGC 
GGCCGCAGTG 
ATCCGTAAGA 
TATGCGGCGA 
CAGAACTTTA 
CTTACCGCTG 
ATCTTTTACT 
AAAGGGAATA 
TTGAAGCATT 
AAATAAACAA 
GTTAATATTT 
TAGGCCGAAA 



TTTTGTTCCC 
CCTGTGTGAA 
TGTAAAGCCT 
CCCGCTTTCC 
GGGAGAGGCG 
TCGGTCGTTC 
ACAGAATCAG 
AAC CGT AAAA 
C AC AAAAAT C 
GCGTTTCCCC 
TACCTGTCCG 
TATCTCAGTT 
CAGCCCGACC 
GACTTATCGC 
GGTGCTACAG 
GGTATCTGCG 
GG C AAAC AAA 
AGAAAAAAAG 
AACGAAAACT 
ATCCTTTTAA 
TCTGACAGTT 
TCATCCATAG 
TCTGGCCCCA 
GCAATAAACC 
TCCATCCAGT 
TTGCGCAACG 
GCTTCATTCA 
AAAAAAGCGG 
T TAT C AC T C A 
TGCTTTTCTG 
CCGAGTTGCT 
AAAGTGCTCA 
TTGAGATCCA 
TTCACCAGCG 
AGGGCGACAC 
TATCAGGGTT 
ATAGGGGTTC 
TGTTAAAATT 
TCGGCAAAAT 



TTTAGTGAGG 
ATTGTTATCC 
GGGGTGCCTA 
AGTCGGGAAA 
GTTTGCGTAT 
GGCTGCGGCG 
GGGATAACGC 
AGGCCGCGTT 
GACGCTCAAG 
CTGGAAGCTC 
CCTTTCTCCC 
CGGTGTAGGT 
GCTGCGCCTT 
CACTGGCAGC 
AGTTCTTGAA 
CTCTGCTGAA 
CCACCGCTGG 
GATCTCAAGA 
CACGTTAAGG 
ATTAAAAATG 
ACCAATGCTT 
TTGCCTGACT 
GTGCTGCAAT 
AGCCAGCCGG 
CTATTAATTG 
TTGTTGC CAT 
GCTCCGGTTC 
TTAGCTCCTT 
TGGTTATGGC 
TGACTGGTGA 
CTTGCCCGGC 
TCATTGGAAA 
GTTCGATGTA 
TTTCTGGGTG 
GGAAATGTTG 
ATTGTCTCAT 
CGCGCACATT 
CGCGTTAAAT 
CCCTTATAAA 



GTTAATTGCG 
GCTCACAATT 
ATGAGTGAGC 
CCTGTCGTGC 
TGGGCGCTCT 
AGCGGTATCA 
AGGAAAGAAC 
GCTGGCGTTT 
TCAGAGGTGG 
CCTCGTGCGC 
TTCGGGAAGC 
CGTTCGCTCC 
ATCCGGTAAC 
AGCCACTGGT 
GTGGTGGCCT 
GCCAGTTACC 
TAGCGGTGGT 
AGATCCTTTG 
GATTTTGGTC 
AAGTTTTAAA 
AATCAGTGAG 
CCCCGTCGTG 
GATACCGCGA 
AAGGGCCGAG 
TTGCCGGGAA 
TGCTACAGGC 
CCAACGATCA 
CGGTCCTCCG 
AGCACTGCAT 
GTACTCAACC 
GTCAATACGG 
ACGTTCTTCG 
ACCCACTCGT 
AGCAAAAACA 
AAT AC T CAT A 
GAG C GG AT AC 
TCCCCGAAAA 
TTTTGTTAAA 
TCAAAAGAAT 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



AG AC CG AG AT AGGGTTGAGT GTTOTTCCAa TTTGGAAC- GAG^CCAC^ TTAAAG^ 
TGGACTCCAA CGTCAAAGGG CGAAAAACGG ^TATC AGCACTAAAT CGGAACCCTA 

CATCACCCTA ATCAAGTTTT TTGGGGTCGA ^ GAACGTGGCG AGAAAGGAAG 

AAGGGAGCCC CCGATTTAGA GCTTGACGGG TGTAGCGGTC ACGCTGCGCG 

GGAAGAAAGC G AAAGG AG CG GGCGCTAGGG ^CTG CGCGTCCCAT TCGCCATTCA 

TAACCACCAC ACCCGCCGCG CTTAATGCGC JGCTACAG TTCGCTATTA CGCCAGCTGG 
GGCTGCGCAA CTGTTGGGAA GGGCGATCGG TGCGGGCC^ TCCCAGTCAC 
CGAAAGGGGG ATGTGCTGCA AGGCGATTAA GTTGGG™C CTCACTATAG GGCGAATTGG 2880 
GACGTTGTAA AACGACGGCC AGTGAGCGCG ^TAATAC CGTCGACTAG AGGGCCCGAC 2940 
AGCTCCACCG CGGTGGCGGC CGCTCTAGAA ™TGGATC ^ TTGACACAAG 3000 

GTCGAACTTA GGCACTAAGG GATGTGAGGC CAGCATCAC^ ^ GCGTTGACCA 3060 

CATCACCACA ATTTTCCAAA TAGAGTTTCA ^TTCli qtTGACGGTG TCTGGTTCAT 3120 

TGTAGTCACA CATGGAAGCC CTACACCCCA ^C^A GCATTTTCCG CTACTAGTCA 3180 

ssss sees esse ™c= ™ = c = lz 

cS C T = SStT ^ "C, ,3,0 



TGGTG 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Other 

( Xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
CTAAATTGTA AGCGTTAATA TTTT™ A^TCGCG^ ^TTTTGTT = CTC 
ATTTTTTAAC CAATAGGCCG AAATCGGCAA ^CCCTTA A ACGTGGACT C 

GATAGGGTTG AGTGTTGTTC -OTT-GAA C^GTCCA ^TCACC 
CAACGTCAAA GGGCGAAAAA CCGTCTATCA ^ ^tcGGAACC CTAAAGGGAG 

CTAATCAAGT TTTTTGGGGT CGAGGTGCCG JAAAGCA QCGAGAAAGG AAGGGAAGAA 

CCCCCGATTT AGAGCTTGAC GGGGAAAGCC ^GAACG gtcacgctgc GCGTAACCAC 
AGCGAAAGGA GCGGGCGCTA ^GCGCTGGC ^CGCGTCC CATTCGCCAT TCAGGCTGCG 
CACACCCGCC GCGCTTAATG CGCCGCTACA ^ TTACGCCAGC TGGCGAAAGG 

CAACTGTTGG G AAGGG CG AT ^GTGCGGGC ^T TTTTCCCAGT CACGACGTTG 

GGGATGTGCT GCAAGGCGAT TAAGTTGGGT T AGGGCGAAT TGGAGCTCCA 

TAAAACGACG GCCAGTGAGC GCGCGTAATA ^ACTCACTA Q GCCCTAAAGC 

CCGCGGTGGC GGCCGCTCTA <^™TATAA ™TAAGCT ^ TCGTCGCACG 

AC T AT C GT AT CACCTATCTA AATAAGTCAC GGGAGTT c AC AC AT AA AGGTTTATGT 

GAATTGCATG TTTCTTGTTG ^GCATATT ^GCAA GGATGCATAT GTATGGTTCT 
ATAAACTTAC ATTTAGCTCA GTTTAATTAC £££££ ATCTTAATTC ACTCCAACAT 
CAATCCATAT AAGTTAGAGT ^^^AG ^^CTAC TTATATTGAG GTGAATTTGG 
ATATGGATCT ACAATACTCA TGTGCATCCA ^AAACT ^ AGGA GTTTCA 

TAGAAATTAA ACTAACTTAC ^ACTAAGCC TTCTTTATAA TCAACCCGCA 

ACGATCGTC C CGCGTCAATA ™TAAAAA JCTCCTACAT TAAGGTGAAA 
CTCTTATAAT CTCTTCTCTA CTACTATAAT ^A ATTCACACAA CCTAATCAAT 

TTATCTATAA GTGTTCTGGA TATTGGTTGT TGGCTCCCA ^ TATATATCAT 

AGAAAACATA TGTTTTATTA AAACAAAATT JATCATM QTGCAACTTA ATACATGTGT 
ATATATATAT AAACCGTAGC AATGCACGGG C ™AACTA ATGGATCGAA 

Sattaagat gaataagagg gtatccaaat aaaaaacttg ™ct tagaagqaga 

AGGGGTTGGA AACGATTAAA CGATTAAATC ^TTCCTAG ^ CTAAAGTGGT 

GAGGAACACG SSSSS ££££ ^GCTCC AAGAATTTGT TGTATCCTTA 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



ACAACTCACA 
ACAAATCCTC 
TATACATGCT 
TTCATGAAAA 
CAACAAAAAA 
TGTAGAAGGC 
ACGTGCTACG 
TACAGCCGTC 
TAACGCTCGC 
TTATCTACTC 
AGGGTGTGAT 
GCAAGAGTTG 
GCGCCAAGAA 
CGAAAGGCTT 
ATTGCAACTT 
ACGAAGAAAT 
GTGATGCTGG 
ATCCCCGGCG 
GCTAGCTAGC 
CCATGCATGG 
GAAAAAAGTA 
ATATATGTTT 
ATCTAAGAAC 
GATGAGATCC 
GTGTATTCAC 
CAACGGAAGC 
AGGGTTAATT 
TCCGCTCACA 
CTAATGAGTG 
AAACCTGTCG 
TATTGGGCGC 
GCGAGCGGTA 
CGCAGGAAAG 
GTTGCTGGCG 
AAGTCAGAGG 
CTCCCTCGTG 
CCCTTCGGGA 
GGTCGTTCGC 
CTTATCCGGT 
AGCAGCCACT 
GAAGTGGTGG 
GAAGCCAGTT 
TGGTAGCGGT 
AGAAGATCCT 
AGGGATTTTG 
ATGAAGTTTT 
CTTAATCAGT 
ACTCCCCGTC 
AATGATACCG 
CGGAAGGGCC 
TTGTTGCCGG 
CATTGCTACA 
TTCCCAACGA 
CTTCGGTCCT 
GGCAGCACTG 
TGAGTACTCA 
GGCGTCAATA 
AAAACGTTCT 



GAACATCAAC 
TCTGTGTGCA 
TACAGCTCAC 
ATAAAATAGG 
AAAGCCATAT 
AACAAAACTG 
TAAAGAGAGT 
TCGGTGGCAT 
ATGCCTGTGC 
CAGAGCGCAG 
GGTGTGTTTA 
CTGCAAGAGT 
GCTTTGCGCA 
CCCCAAATTG 
GGGGTGTAGG 
GAAACTCTAT 
CCTCACATCC 
GTGTCCCCCA 
TAGTTGAGTC 
GTGGCAGTGT 
TTGTTCCAAA 
TCTGTATATG 
TAAAACAAAT 
CTCGTAATAT 
ATTTGTTGCG 
GAGTCGACCT 
GCGCGCTTGG 
ATTCCACACA 
AGCTAACTCA 
TGCCAGCTGC 
TCTTCCGCTT 
TCAGCTCACT 
AACATGTGAG 
TTTTTCCATA 
TGGCGAAACC 
CGCTCTCCTG 
AGCGTGGCGC 
TCCAAGCTGG 
AACTATCGTC 
GGTAACAGGA 
CCTAACTACG 
ACCTTCGGAA 
GGTTTTTTTG 
TTGATCTTTT 
GTCATGAGAT 
AAATCAATCT 
GAGGCACCTA 
GT GT AG AT AA 
CGAGACCCAC 
GAGCGCAGAA 
GAAGCTAGAG 
GGCATCGTGG 
TCAAGGCGAG 
CCGATCGTTG 
CATAATTCTC 
ACCAAGTCAT 
CGGGATAATA 
TCGGGGCGAA 



CAAAATTGCA 
AAGAAACACG 
AAGACATTAC 
CCGGACAGGA 
GTCAAGCTAA 
AGCCACGCAG 
GACGAGTCAT 
AAGAACACAA 
ACTTCTCCAT 
AAGAACCCGA 
CTTATACTGG 
ACCCTGGGAA 
GGCGTCTGCA 
GCCCTTGTGT 
GCTTCCATGT 
TTGGAAAATT 
CTTAGTGCCT 
CTGAAGAAAC 
ATTTAGCGGC 
CAGTGTGAGC 
TTAAACGTTT 
TCTAATTTGT 
GCTAATTTGA 
CACCGACATC 
CGTAGGCGTA 
CGAGGGGGGG 
CGTAATCATG 
ACATACGAGC 
CATTAATTGC 
ATTAATGAAT 
CCTCGCTCAC 
CAAAGGCGGT 
C AAAAGG CCA 
GGCTCCGCCC 
CGACAGGACT 
TTCCGACCCT 
TTTCTCATAG 
GCTGTGTGCA 
TTGAGTCCAA 
TTAGCAGAGC 
GCTACACTAG 
AAAGAGTTGG 
TTTGCAAGCA 
CTACGGGGTC 
TATCAAAAAG 
AAAGTATATA 
TCTCAGCGAT 
CTACGATACG 
GCTCACCGGC 
GTGGTCCTGC 
TAAGTAGTTC 
TGTCACGCTC 
TTACATGATC 
TCAGAAGTAA 
TTACTGTCAT 
TCTGAGAATA 
CCGCGCCACA 
AACTCTCAAG 



CGTCAAGGGT 
GTGAGTCATG 
AAACAACTCA 
CAAAAATCCT 
ATCTAATTCG 
AAGTACAGAA 
ATACATTTGG 
GAAATTGTGT 
CACCACCACT 
TCGACACCAT 
GGTTGGTTCT 
GGAAGTGCTA 
AGTGTAAGCT 
CCAACTCAGA 
GTGACTACAT 
GTGGTGATGC 
AAGTTCGACG 
TATGTGCTGT 
GATGATTGAG 
AATGACCTGA 
TAACCTTTTA 
TATCATCCAT 
AATGAAGGGA 
ACACGTGTCC 
CCCAACAATT 
CCCGGTACCC 
GTCATAGCTG 
CGGAAGCATA 
GTTGCGCTCA 
CGGCCAACGC 
TGACTCGCTG 
AATACGGTTA 
GCAAAAGGCC 
CCCTGACGAG 
ATAAAGATAC 
GCCGCTTACC 
CTCACGCTGT 
CGAACCCCCC 
CCCGGTAAGA 
GAGGTATGTA 
AAGGACAGTA 
TAGCTCTTGA 
GCAGATTACG 
TGACGCTCAG 
GATCTTCACC 
TGAGTAAACT 
CTGTCTATTT 
GGAGGGCTTA 
TCCAGATTTA 
AACTTTATCC 
GCCAGTTAAT 
GTCGTTTGGT 
CCCCATGTTG 
GTTGGCCGCA 
GCCATCCGTA 
GTGTATGCGG 
TAGCAGAACT 
GATCTTACCG 



ATTGGGTAAG 
CCGAGATCAT 
TATTGCATTA 
TGACGTGTAA 
TTTTACGTAG 
TGATTCCAGA 
CAAGAAACCA 
TAATTAATCA 
GGGTCTTCAG 
GGCCACCAGC 
CGAACAGGTG 
CAACCTTTGC 
GACTAGTAGC 
TGAACCAGAC 
GGTCAACGCA 
TTGTGTCAAT 
TCGGGCCCTC 
AGTATAGCCG 
TAATAATGTG 
ATGAACAATT 
ATAGGTTTAT 
TTAGATATAG 
GTATATATTG 
AGTTAATGTA 
TTGATCGACT 
AGCTTTTGTT 
TTTCCTGTGT 
AAGTGTAAAG 
CTGCCCGCTT 
GCGGGGAGAG 
CGCTCGGTCG 
TCCACAGAAT 
AGGAACCGTA 
CATCACAAAA 
CAGGCGTTTC 
GGATACCTGT 
AGGTATCTCA 
GTTCAGCCCG 
CACGACTTAT 
GGCGGTGCTA 
TTTGGTATCT 
TCCGGCAAAC 
CGCAGAAAAA 
TGGAACGAAA 
TAGATCCTTT 
TGGTCTGACA 
CGTTCATCCA 
CCATCTGGCC 
TCAGCAATAA 
GCCTCCATCC 
AGTTTGCGCA 
ATGGCTTCAT 
TGCAAAAAAG 
GTGTTATCAC 
AGATGCTTTT 
CGACCGAGTT 
TTAAAAGTGC 
CTGTTGAGAT 



AAACAATCAA 
ACTCATCTGA 
CAAAGATCGT 
AGTAAATTTA 
ATCAACAACC 
TGAACCATCG 
TGAAGCTGCC 
AAGCTATAAA 
ACCATTAGCT 
AAGGGCCTCA 
CAAGTAGAAG 
AAAGTCAAAG 
GGAAAATGCC 
ACCGTCAAGT 
GCTGCTGACG 
TTCTGCAACG 
TAGTCGACGG 
CTGCCCGCTG 
TCACGCATCA 
GAAATGAAAA 
ACAATAATTG 
ACAAAAAAAA 
GGATAATGTC 
TCAGTGATAC 
ATCAGAAAGT 
CCCTTTAGTG 
GAAATTGTTA 
CCTGGGGTGC 
TCCAGTCGGG 
GCGGTTTGCG 
TTCGGCTGCG 
CAGGGGATAA 
AAAAGGCCGC 
ATCGACGCTC 
CCCCTGGAAG 
CCGCCTTTCT 
GTTCGGTGTA 
ACCGCTGCGC 
CGCCACTGGC 
CAGAGTTCTT 
GCGCTCTGCT 
AAACCACCGC 
AAGGATCTCA 
ACTCACGTTA 
TAAATTAAAA 
GTTACCAATG 
TAGTTGCCTG 
CCAGTGCTGC 
ACCAGCCAGC 
AGTCTATTAA 
ACGTTGTTGC 
TCAGCTCCGG 
CGGTTAGCTC 
TCATGGTTAT 
CTGTGACTGG 
GCTCTTGCCC 
TCATCATTGG 
CCAGTTCGAT 



1680 
1740 
1800 
1860 
1920 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 



-45- 



GTAACCCACT CGTGCACCGA = -C = TTTT = CA 0C = 
GTGAGCAAAA ACAGGAAGGC ^TGCCGC TTATTGAAGC ATTTATCAGG GTTATTGTCT 
CATGAGCGGA S-G HESES GAAAAATAAA C_G TT CCGCGCAC 

ATTTCCCCGA AAAGTGCCAC 



5160 
5220 
5280 
5340 
5360 



(2) 



INFORMATION FOR SEQ ID NO : 4 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5511 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



TCGCGCGTTT 
CAGCTTGTCT 
TTGGCGGGTG 
ACCATATGCG 
ATTCGCCATT 
TACGCCAGCT 
TTTCCCAGTC 
GCATATCTGT 
CGATGCTGAG 
T C GG C TAT AG 
TTGGGATACT 
TTTTAACAGC 
GTAACTTTTC 
AAATAAAGAA 
CAAAATTATC 
CGAGATCATT 
ATGTATCGTC 
GGGGTGCTGT 
TTCGTCTGCT 
GACAAATGAA 
TGCCTCAGTA 
ACTGCCTGCC 
GTCTCCACTC 
CCTGCTCGTT 
GCCAGCATCA 
CATTTCTTCG 
CAAGTTGCAA 
GAAGCCTTTC 
CTTCTTGGCG 
GCAACTCTTG 
CATCACACCC 
GGTGTGTGTT 
TATACCAGCG 
GTGGCAGCAG 
GCTCCGGCTC 
AGTGGTGTCG 
CCTACTAGGT 
GGGCAGTTTT 
AGCTATTAGT 
TGGTCTATTA 



CGGTGATGAC 
GTAAGCGGAT 
TCGGGGCTGG 
GTGTGAAATA 
CAGGCTGCGC 
GGCGAAAGGG 
ACGACGTTGT 
GCATTACTAC 
AACGAACGGG 
CAGGTGAAAG 
TAAATTTGGA 
TCTAGGCAGG 
TCTCTGTTTC 
AAACGGAGGC 
TTCTAATTTT 
GCTTGGGATG 
TGAAACATGT 
TTGAGCCTAG 
GTGTCTGTTC 
GAAGCGAAGA 
GCTTACAGCA 
GCTTACGTAC 
AGGCTCATGC 
TTGGCGAGCT 
CCGTTGCAGA 
TCGTCAGCAG 
TACTTGACGG 
GGGCATTTTC 
CCTTTGACTT 
CCTTCTACTT 
TTGAGGCC CT 
GGATCCTTGG 
CCGGGCCCTG 
ACACACGGGT 
GGTGCACGCC 
TGTTCCGGGG 
ACGTTCATTG 
TTTTTAGAAA 
TGGTTTTAGC 
GTTGACTCAT 



GGTGAAAACC 
GCCGGGAGCA 
CTTAACTATG 
CCGCACAGAT 
AACTGTTGGG 
GGATGTGCTG 
AAAACGACGG 
CTGGGATACA 
GAAGAAAGTA 
TTCGTGCGCC 
GAGTTTGTTG 
ATATAAGATT 
GTTTATAGCC 
AGTAAAAAAT 
AAAAGCTACA 
GGCAGGGCCA 
AGGGGCTAAT 
CGATGAAGGG 
TCAGCGTAAC 
GATTATAGAA 
TCGTACCCGC 
ATAGTTAACA 
TACGTACGCA 
AGAGGGCCCG 
AATTGACACA 
CTGCGTTGAC 
TGTCTGGTTC 
CGCTACTAGT 
TGCAAAGGTT 
GCACCTGTTC 
TGC TGGTGGC 
GTTGGCTGTA 
GTACGTGGCG 
CGCCACGTCC 
AGGGTGTGGC 
ACTCCGATCC 
TATCTGGACG 
AACAGAGAAT 
TTAGATTAAA 
TTTAAGGCCC 



TCTGACACAT 
GACAAGCCCG 
CGGCATCAGA 
GCGTAAGGAG 
AAGGGCGATC 
CAAGGCGATT 
CCAGTGAATT 
AGGGCTTCTC 
AGCGCCGCCC 
AATGAAAAGG 
CATACACTAA 
TATATCTAAT 
GATTAGCACA 
ACCCAAAAAA 
TATTAAAAAT 
ATAGCTAATT 
AGTTAGATGA 
TCATAGTTTC 
GGCATCAATG 
CACGCACGTG 
ACGTACATAC 
CGCAGAGAGG 
CGTCGGTCGC 
ACGTCGAACT 
AG CAT C AC C A 
CATGTAGTCA 
ATCTGAGTTG 
CAGCTTACAC 
GTAGCACTTC 
GAGAACCAAC 
CATGGTGTAG 
TGCAGAACTA 
CGGCCGCGCG 
CGCCGTACTC 
CGCCTCTGAG 
GCGGCGAGCG 
ACGGGCAGCG 
TCCGTTGAGC 
GAAGCTAACG 
TGTTTCAATC 



GCAGCTCCCG 
TCAGGGCGCG 
GCAGATTGTA 
AAAATACCGC 
GGTGCGGGCC 
AAGTTGGGTA 
CTTTTATGAA 
CGCCATAACA 
AAAAAAAACG 
GAACGATATG 
TCCACTAAAG 
CTGTTGGAGT 
AAATTAAACT 
ATACTTGGAG 
ACTATATATT 
GCTAAGGATG 
CTAATTTGCT 
ATACAAGAAC 
GATGCCAAAC 
TCATTATTTA 
TACAGAGCCA 
TATATACATA 
GCGCCACCCT 
TAGGCACTAA 
CAATTTTCCA 
CACATGGAAG 
GACACAAGGG 
TTGCAGACGC 
CTTCCCAGGG 
CCCAGTATAA 
TGTCGACTGT 
AAGCGGAGGT 
GCTACGTGGA 
TCCTTACCGT 
CAGACTTTGT 
AC CG AGCGTG 
G AC AATTTG C 
TAATTGTAAT 
ACTAATAGCT 
TCGCGAGATA 



GAGACGGTCA 
TCAGCGGGTG 
CTGAGAGTGC 
ATCAGGCGCC 
TCTTCGCTAT 
ACGCCAGGGT 
TAATAATAAT 
AATTGAGTTG 
AACATGTACG 
CGTTGGGTAG 
TTGTCTATCT 
TG CTTTT AG A 
AGGTGACGAG 
ATTTTTGTCT 
AAAAATACTT 
GGCTATATTT 
GTGTTCGTAC 
TCACTTTTGG 
TCCGCAAGGG 
TTTATGGACT 
CACTTATTGC 
CACGTCCAAC 
CTCGTTGCTT 
GGGATGTGAG 
AATAGAGTTT 
CCCTACACCC 
CCAATTTGGG 
CTGCGCAAAG 
TACTCTTGCA 
GTAAACACAC 
GATATCCTCG 
GGCGCGCATT 
GGAAGGCTGC 
GCTTATCCGG 
CGTGTTCCAC 
TAAAAGAGTT 
TGTAAGAGAG 
TCAACAAATA 
AATAATTAGT 
AACTTTAGCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



-46- 



GCTATTTTTT 
TGAAACTAAA 
ATCCCGTGAG 
ACTACTGGAA 
TGCCTAGTGT 
TGTCGAGTAC 
TCTATGATTT 
AAACATTATC 
TCATTAATTT 
CACGGTCATG 
TAACTTGTGC 
AGTGACTTGC 
ATATTTATAG 
ACATAGTGTA 
CTTTATCGAG 
TTCCTGTGTG 
AGTGTAAAGC 
TGCCCGCTTT 
CGGGGAGAGG 
GCTCGGTCGT 
CCACAGAATC 
GGAACCGTAA 
ATCACAAAAA 
AGGCGTTTCC 
GATACCTGTC 
GGTATCTCAG 
TTCAGCCCGA 
ACGACTTATC 
GCGGTGCTAC 
TTGGTATCTG 
CCGGCAAACA 
GCAGAAAAAA 
GGAACGAAAA 
AGATCCTTTT 
GGTCTGACAG 
GTTCATCCAT 
CATCTGGCCC 
C AG C AAT AAA 
CCTCCATCCA 
GTTTGCGCAA 
TGGCTTCATT 
GCAAAAAAGC 
TGTTATCACT 
GATGCTTTTC 
GACCGAGTTG 
TAAAAGTGCT 
TGTTGAGATC 
CTTTCACCAG 
TAAGGGCGAC 
TTTATCAGGG 
AAATAGGGGT 
TTATCATGAC 



AGCTACTTTT 
CTTTAGCACT 
ATTGAAACGG 
TCGAGCGCTT 
GACACTCGAC 
TTTTTATCGG 
TATGTGTAGG 
AAATTTTTAT 
CTGACTTTAT 
TTTAGTGAGC 
TAGATAACAT 
AGTTTAAATG 
AAAACATTTT 
GGAACATACC 
TGTCCAAGGA 
AAATTGTTAT 
CTGGGGTGCC 
CCAGTCGGGA 
CGGTTTGCGT 
TCGGCTGCGG 
AGGGGATAAC 
AAAGGCCGCG 
TCGACGCTCA 
CCCTGGAAGC 
CGCCTTTCTC 
TTCGGTGTAG 
CCGCTGCGCC 
GCCACTGGCA 
AGAGTTCTTG 
CGCTCTGCTG 
AACCACCGCT 
AGGATCTCAA 
CTCACGTTAA 
AAATTAAAAA 
TTACCAATGC 
AGTTGCCTGA 
CAGTGCTGCA 
CCAGCCAGCC 
GTCTATTAAT 
CGTTGTTGCC 
CAGCTCCGGT 
GGTTAGCTCC 
CATGGTTATG 
TGTGACTGGT 
CTCTTGCCCG 
CATCATTGGA 
CAGTTCGATG 
CGTTTCTGGG 
ACGGAAATGT 
TTATTGTCTC 
TCCGCGCACA 
ATTAACCTAT 



AGCCATTTGT 
TCAATTCATA 
GGCCTAAATC 
TGCCGAGTGT 
AAAG AG AT C T 
ACACTTGACA 
TCACTTAGGT 
CACAACCTCT 
TTGTGTTTTA 
ATGGTGCTTG 
GC AT AT C ATT 
TGAATTTTCC 
TGTAAATATG 
ACAAAAAGTT 
TGGCACTCGG 
CCGCTCACAA 
TAATGAGTGA 
AACCTGTCGT 
ATTGGGCGCT 
CGAGCGGTAT 
GCAGGAAAGA 
TTGCTGGCGT 
AGTCAGAGGT 
TCCCTCGTGC 
CCTTCGGGAA 
GTCGTTCGCT 
TTATCCGGTA 
GCAGCCACTG 
AAGTGGTGGC 
AAGCCAGTTA 
GGTAGCGGTG 
GAAGATCCTT 
GGGATTTTGG 
TGAAGTTTTA 
TTAATCAGTG 
CTCCCCGTCG 
ATGATACCGC 
GGAAGGGCCG 
TGTTGCCGGG 
ATTGCTACAG 
TCCCAACGAT 
TTCGGTCCTC 
GCAGCACTGC 
GAGTACTCAA 
GCGTCAATAC 
AAACGTTCTT 
TAACCCACTC 
TGAGCAAAAA 
TG AAT ACT C A 
ATGAGC GG AT 
TTTCCCCGAA 
AAAAATAGGC 



AATCTAAACA 
TAGCTAAAGT 
TCTCAGCTAT 
CAAAGCCTGA 
CGACGAACAG 
AAGTCTTTGT 
TTCTACACAT 
ATATATGATA 
TACAATTTTT 
AAGATTCTGG 
TATTTTGCAT 
GAAGAAATTC 
TAATTGTGCC 
TGGTTGGCAA 
CAAGCTTGGC 
TTCCACACAA 
GCTAACTCAC 
GCCAGCTGCA 
CTTCCGCTTC 
CAGCTCACTC 
ACATGTGAGC 
TTTTCCATAG 
GGCGAAACCC 
GCTCTCCTGT 
GCGTGGCGCT 
CCAAGCTGGG 
ACTATCGTCT 
GTAACAGGAT 
CTAACTACGG 
CCTTCGGAAA 
GTTTTTTTGT 
TGATCTTTTC 
TCATGAGATT 
AATCAATCTA 
AGGCACCTAT 
TGTAGATAAC 
GAGACC CACG 
AGCGCAGAAG 
AAGCTAGAGT 
GCATCGTGGT 
CAAGGCGAGT 
CGATCGTTGT 
ATAATTCTCT 
CCAAGTCATT 
GGGATAATAC 
CGGGGCGAAA 
GTGCACCCAA 
CAGGAAGGCA 
TACTCTTCCT 
ACATATTTGA 
AAGTGCCACC 
GTATCACGAG 



GGAGAGCTAA 
TTAGCAGGAA 
TTTTGATGCA 
AAAACACTCC 
TACATCGACA 
CGAGTGAACT 
AGTACGTCAC 
TCATGACATG 
AAAC AAC TAG 
TCTGCTTCTG 
GCACGGTTTT 
AAATAAACGA 
AAAATGGTAC 
AATAAAAAAA 
GTAATCATGG 
CATACGAGCC 
ATTAATTGCG 
TTAATGAATC 
CTCGCTCACT 
AAAGGCGGTA 
AAAAGGCCAG 
GCTCCGCCCC 
GACAGGACTA 
TCCGACCCTG 
TTCTCAATGC 
CTGTGTGCAC 
TGAGTCCAAC 
TAGCAGAGCG 
CTACACTAGA 
AAGAGTTGGT 
TTGCAAGCAG 
TACGGGGTCT 
ATCAAAAAGG 
AAGTATATAT 
CTCAGCGATC 
TACGATACGG 
CTCACCGGCT 
TGGTCCTGCA 
AAGTAGTTCG 
GTCACGCTCG 
TACATGATCC 
CAGAAGTAAG 
TACTGTCATG 
CTGAGAATAG 
CGCGCCACAT 
ACTCTCAAGG 
CTGATCTTCA 
AAATGCCGCA 
TTTTCAATAT 
ATGTATTTAG 
TGACGTCTAA 
GCCCTTTCGT 



TGGTGGTAAT 
GCTAAACTTT 
AATTACTGTC 
GTAAAGACTT 
ACGGCTTCTT 
ACATTGAAAC 
AACTTTACCG 
TGGACAAGTT 
ATAACAAGTT 
AAATCGGTCG 
CCATGTTTCG 
ACTAAATCTA 
ATGTAGATCT 
ATAAAATATA 
TCATAGCTGT 
GGAAGCATAA 
TTGCGCTCAC 
GGCCAACGCG 
GACTCGCTGC 
ATACGGTTAT 
CAAAAGGCCA 
CCTGACGAGC 
TAAAGATACC 
CCGCTTACCG 
TCACGCTGTA 
GAACCCCCCG 
CCGGTAAGAC 
AGGTATGTAG 
AGGACAGTAT 
AGCTCTTGAT 
CAGATTACGC 
GACGCTCAGT 
ATCTTCACCT 
GAGTAAACTT 
TGTCTATTTC 
GAGGGCTTAC 
CCAGATTTAT 
ACTTTATCCG 
CCAGTTAATA 
TCGTTTGGTA 
CCCATGTTGT 
TTGGCCGCAG 
CCATCCGTAA 
TGTATGCGGC 
AGCAGAACTT 
ATCTTACCGC 
GCATCTTTTA 
AAAAAGGGAA 
TATTGAAGCA 
AAAAATAAAC 
GAAACCATTA 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5511 



(2) INFORMATION FOR SEQ ID NO : 5 : 



L) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5115 base pairs 

(B) TYPE: n 



cleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

i) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



GT tgggagct ctcccatatg area™ jaacoaccac ^ ™ccc 

CCCTCGAGGT C G AC GG TAT C GATAAGCTTG ™TCTTAC TTTGGTGAAG 
TTACACAGCC CGCTCGGGCC CGCGACGTCG GGACACATCT TGTTGTCTCG 
CTCTGCTCGC AGCTGTCCGG CTCCTTGGAC ^TCGTGTGG ^ CAGGCTTAAA 
TCTCCTGTGC TTCCTGGGTA GCTTGTGTAG ^GAGCTGAC ™ CCTGTGAAGT 
ATTTGCTCGT AGACGAGGAG T AC C AGC AC A ^ACGTTGCG CGATGCGGTG 
GCAACGTCTA GGATTGTCAC ACGCCTTGGT C^GTCGCGT CGC CGTCGTACGT 
GTGAGCAGAG CAGCAACAGC TGGGCGGCCC ^GTTGGCT TCCG GGGAGGTGGT 
ACGCGCGCGC CGGGGACACG CAGCAGAGAG CGGAGAGCGA GGCAACCCAA AAGTACCCAC 
GTGGAAGTGG AGCCGCGCGC CCGGCCGCCC JCGCCCGGT GTCGCGTCGG 
GACAAGCGAA GGCGCCAAAG CGATCCAAGC TCCGGAACGC GGACGGACGC 
AGAGCCAGCC ACAAGCAGCC GAGAACCGAA ^GGTGGGCG ^GC ACGACAAGCC 
GGGCGACGCT TCCAAACGGG CCACGTACGC CGGCGTGTGC GTGCGGGTCA 
AAGGCGAGGC AGCCCCCGAT CGGGAAAGCG ^ GGGCGCGGGC GGAGGAGAGC 

GTCGCTGGTG CGCAGTGCCG GGGGGAACGG ™CG CGCGCCCCAC QTACTGCCCT 

GTGGCGAGGG CCGAGAGCAG CGCGCGGCCG ^^AC GGGGGGGC CC CGTCACATCC 

CCCCCTCCGC GCGCGCTAGA ^ACCGAGG CCTGGACCG ^ CGACAGCCGC 

ATCCATCGAC CGATCGATCG CCACAGCCAA CACCAC^ CTGCTCCGTC 
CAGGAGGAAG GAATAAACTC ACTGCCAGCC * CGACGACCAG GTTCTGTTCC 

GACCAGTGCG CGCACCGCCC ^GCAGGGCTG ^CATCTCG TQCGTGTTTG 
GATCCGATCC GATCCTGTCC TTGAGTTTCG ^CCAGATCCT TC TCTCCTAC 
ATGATCCAGG TTCTTCGAAC CTAAATCTGT ^GTGCACAC f C GTGTGTTTAC 
GCAGTGGATT AATCGCCATG GCCACCAGCA ^GGCCTCAA GG TGCAAGAGTA 
TTATACTGGG GTTGGTTCTC ^AACAGGTGC ^TA^GG CGCCAAGAAG CTTTGCGCAG 
CCCTGGGAAG GAAGTGCTAC AACCTTTGCA GAAAGGCTTC CCCAAATTGG 

GCGTCTGCAA GTGTAAGCTG ACTAGTAGCG ^MGCCC G GGTGTAGGG 
CCCTTGTGTC CAACTCAGAT ^CCAGACA CCGTCAAGTA T ^ AAACTCTATT 

CTTCCATGTG TGACTACATG J^AACGCAG CTGCTGACG gqc CTCACATCCC 

TGGAAAATTG TGGTGATGCT ^TGTCAATT ^CTGCAAC ^ G TTCGCCCTG 

TTAGTGCCTA AGTTCGACGT CGGGCCCTCT ^ATGC CATCTTCTTA TATATGCTGT 

CAGGGCCCCT GATCTCGCGC ^GGTGCAAA JATGTTGGGA CATC GTAATGTAGT 
TTCGCTTATG TGATATGGAC ^GTATGTGT ^ATGCTTGC * CGTGTGTAGT 
GTAGTGGTGG C C AGTGGC AC A^CTAATAA GCGCATGAAC TGGTGGAGTA 
TAAGTACCGA TCGGTAATTT ^ATATTGCGA ^AAATAAAT <* TAGAGTACAT 
AATAATCCCT GCTGTTCGGT GTTCTTATCG CTCCTCGTAT * AAATTTCTGC 
TTTTCTCTCT CTGAATCCTA CGTGTGTGAA ^TTCTATAT ^ GGCAGCTTCT 
GTTCCAAAAG AG AC CAT AGC CTATCTTTGG CCCTGTTTGT ? TTCTATAAAA 
GGCCACCAAA AGCTGCTGCG ^CTGCCAAA CGCTCAGATT * A GTCGTTGTA 
TTAGTTGGGG CAAAAACCAT CCAAAATCAA TATAAACACA TCTTTCTCCA 
ATATTAGGAA TCTGTCACTT TCTAGATCCT GAGCCCTATG ^AAC C AGC C AG ATT 

TACGTAATCG T AAT GAT AC T CAGATTCTCT CCACAGCCAG TGCATGGATC 
TTCAGAAAAG CTGGTCAGAA ^TTAAA CCAAACAGAC CCT^ TAGAGCTTTA 
GGCTTTCCCC GTCAAGCTCT AAATCGGGGG CTCCC1 1 1 GC CATCGCCC 
CGGCACCTCG AC CGC AAAAA ACTTGATTTG GGTGATGGTT CACGT TGGACTCTTG 
TGATAGACGG TTTTTCGCCC TTT G AC G TTG GAGTCCACGT JCTT ATAAGGGATT 
TTCCAAACTG GAACAACACT CAACCCTATC ^GGTCTATT C TAACGCGAAT 
TTGCCGATTT CGGCCTATTG GTTAAAAAAT ^GCTGATTT ^ TTACGCATCT 
TTTAACAAAA T ATT AACG TT ^AATTTCG CCTQATOOOQ * CGGAACCCCT 
GTGCGGTATT TCACACCGCA T AC AGGTGG C ACl il TCATGAGACA ATAACCCTGA 

ATTTGTTTAT TTTTCTAAAT ACATTCAAAT ^GTATCCGC T ^ CCGTGT CGCC 

TAAATGCTTC AATAATATTG AAAAAGGAAG AGTATbAbiA 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
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CTTATTCCCT TTTTTGCGGC A = * T CC = = »~G 
A AAGTAAAAG ATGCTGAAGA TCAGTTGGGT ^ACGAG GTTTTCCAAT GATGAGCACT 

AAC AG CGGT A AGATCCTTGA CAGTTTTCGC CCCCHttGAAC ^ AGAGCAACTC 

TTTAAAGTTC TGCTATGTCA TACACTATTA ^GTATTG T CACAGAAAAG 

GGTCGCCGGG CGCGGTATTC TCAGAATGAC TTGGTTGAG CATGAGTGAT 
CATCTTACGG ATGGCATGAC AGTAAGAGAA ™GCAGTG ^CCGCTTTT 
AACACTGCGG CCAACTTACT ^TGACAACG ATCGGAGGAC CG ^ 
TTGCACAACA TGGGGGATCA TGTAACTCGC ™ATCGT aacGTTGCGC 
OCCATACCAA ACGACGAGCG ™CACCAOG ATGCCJGTAG ^ AQACTGGATG 

AAACTATTAA CTGGCGAACT ACTTACTCTA G <~ 1 TTCCGGCTGG CTGGTTTATT 

GAGGCGGATA AAGTTGCAGG JCCACTTCTG CGCTCGGCCC ^ actggggcca 

GCTGATAAAT CTGGAGCCGG ^AGCGTGGG TCTCGCG QG AGT C AGG C AACTATGGAT 

GATGGTAAGC CCTCCCGTAT CGTAGTTATC ™CACGAC TTAAGCATTG GTAACTGTCA 
GAACGAAATA GACAGATCGC ^GAGATAGGT GCCTCACT^ XTCATTTTTA ATTTAAAAGG 
GAC CAAGTTT AC T CAT AT AT ^CTTTAGATT GATTTAAAAC TGAGTTTTCG 
ATC T AGGTG A AGATCCTTTT TGATAATCTC JTGACCA^ CTTCTTGAGA TCCTTTTTTT 
TTCCACTGAG CGTCAGACCC CGTAGAAAAG JTCAAAGG^ TACCAGCGGT GGTTTGTTTG 
CTGCGCGTAA TCTGCTGCTT GCAAACAAAA GCTTCAGCAG AGCGCAGATA 

CCGGATCAAG AGCTACCAAC TCTTTTTCCG ^JGTAACTG CTCTGTAGCA 
CCAAATACTG TCCTTCTAGT GTAGCCGTAG ™GGCCACC ACT ^ TGQCGATAAG 

CCGCCTACAT ACCTCGCTCT CCTAATCCTG ™CAGT cGCA GCGGTCGGGC 

TCGTGTCTTA CCGGGTTGGA CTCAAGACGA TAGTTAC TACAC CGAAC TGAGA 

TGAACGGGGG GTTCGTGCAC ^AGCCCAGC ™AGCG ^ GGGAGAAA GGCGGACAGG 
T AC C T AC AG C GTGAGCTATG AGAAAGCGCC A ^GCTTC GGG AGCTTCC AGGGGGAAAC 

TATCCGGTAA GCGGCAGGGT CGGAACAGGA ^GCGCACG cG TCGATTTTTG 

GCCTGGTATC TTTATAGTCC TGTCGGGTTT ^GCCACCTCT CTTTTTACGG 
TGATGCTCGT CAGGGGGGCG GAGCCTATCG JAAAACGCCA ^AAC G CCTGATTCT 
TTCCTGGCCT TTTGCTGGCC ™GCTCAC ^GTTCTTTC ^ CCGAA CGACC 
GT GG AT AAC C GTATTACCGC CTTTGAGTGA ^TGATACCG AC CGCCTCTC 
GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG ^GAGCGCC CAAT ,-. t GG AAAGCG 

CCCGCGCGTT GGCCGATTCA ^AATGCAGC ^ACGACA * ^ ccagqcttta 

GGCAGTGAGC GCAACGCAAT ™GTGAGT ^GCTCA qcggataaca ATTTCAC ACA 

GGAAACAGCT SSSS ESSE CTATTTAGGT GACACTATAG AATACTCAAG 



CTATGCATCC AACGC 

(2) INFORMATION FOR SEQ ID NO : 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5115 



(ii) MOLECULE TYPE: Other 

( Xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

CTAAATTGTA AGCGTTAATA TTTTGTTAAA — AAA TCAAAAG — CCGA 
ATTTTTTAAC CAATAGGCCG AAATCGGCAA AATCCCTTAT ™ ACGTGGACTC 
GATAGGGTTG AGTGTTGTTC CAGTTTGGAA ^AAGAGTCCA C ^CCATCACC 
CAACGTCAAA GGGCGAAAAA CCGTCTATCA °GGCGATGGC CCA £ ctaaaggqag 

CTAATCAAGT TTTTTGGGGT CGAGGTGCOG TAAAGCAC AAGGGAAGAA 
CCCCCGATTT AGAGCTTGAC GGGGAAAGCC CGCGAACGTG GCGTAACCAC 
AGCGAAAGGA GCGGGCGCTA GGGCGCTGGC AAGTGTAGCG ^ TCAGGCTGCG 
CACACCCGCC GCGCTTAATG CGCCGCTACA ^GCGCGTCC TGGCGAAAGG 
GGGATGTGCT G^CgS TAAGTTGGGT ^CCAGGG TTTTCCCAGT CACGACGTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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660 
720 
780 
840 
900 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



T AAAAC G AC G GCCAGTGAGC GCG = CGACTCACTA = -™ 

™r T = ™c, jc^oj 

GAATTGCATG TTTCTTGTTG G AAG CAT AT T CACGCAATCT ™ACA TCT 9Q0 

ATAAACTTAC ATTTAGCTCA ^TTAATTAC AGTCTTATTT ^A ACTCCAACAT 960 

CAATCCATAT AAGTTAGAGT AAAAAATAAG TTTAAATTTT GTGAATTTGG 1020 

ATATGGATCT ACAATACTCA TGTGCATCCA ^AAACTAC ™TATT^ ACCAGTTTCA 
TAGAAATTAA ACTAACTTAC ^ACTAAGCC ^CTTTACT TTCTTTATAA TCAACCCGCA 
ACGATCGTCC CGCGTCAATA TTATTAAAAA ^TCCTACAT J TAAGGTGAAA 
CTCTTATAAT CTCTTCTCTA CTACTATAAT ATT ScACAA CCTAATCAAT 

TTATCTATAA GTGTTCTGGA TATTGGTTGT ^CTCCCAT * TAT AT AT CAT 

AGAAAACATA TGTTTTATTA AAACAAAATT ^TCATATAT ^ ATACATGTGT 

ATATATATAT AAACCGTAGC AATGCACGGG CATATAAC ATGGATCGAA 
GTATTAAGAT GAATAAGAGG ^TCCAAAT AAAAAACTTG TTGCTT^ TAGAAGGAGA 
AGGGGTTGGA AACGATTAAA CGATTAAATC JCTTCCTAGT C CTAAAGTGGT 
TTTAATATAT CCCAATCCCC TTCGATCATC CAGGTGCAAC TGTATCCTTA 
GAGGAACACG AAAGAACCAT OCJTTGO^T ™CTCC JAGAA^ 

ACAACTCACA GAACATCAAC CAAAATTGCA CGTCAA CCGAGATCAT ACTCATCTGA 1740 

ACAAATCCTC TCTGTGTGCA AAGAAACACG ^GAGTCATG CAAAGATCGT 1800 

TATACATGCT TACAGCTCAC AAGACATTAC AJACAACK* ™£ ^qTAAATTTA I860 

TTCATGAAAA ATAAAATAGG CCGGACAGGA CAAAAATC ATCAACAACC 1920 

CAACAAAAAA AAAG C CAT AT ^CAAGCTAA JTCTAATTCG TGAACCATCG 1980 

TGTAGAAGGC AACAAAACTG ^CCACGCAG ^TACA CAAGAAACCA TGAAGCTGCC 2040 

ACGTGCTACG TAAAGAGAGT GACGAGTCAT ^ACATl AAGCTATAAA 2100 

TACAGCCGTA TCGGTGGCAT AAGAACACAA GAAATTGTGT JAATTAA ACCATTAGCT 2160 

T AACGC T CGC ATGCCTGTGC ACTTCTCCAT CACCACCACT ^GTCT^ ^ 
TTATCTACTC CAGAGCGCAG ^GAACCCGA TCGACACCAT CAGCACCAGC 
TCATCTCTCT TCTCTTCTGC ^CGCCCACA CTTGCAGCGC C ^GCACATCA 
AAGATAGCTG OCGCAAGCM CTTAAGGGGG TGAACCTCAC 

TGGAGAAGAT CCAAGGCCGC GGCGATGA^ * GGAAGGAAAA GACGAAGACG 

TCAGGACCAT GCGGGGGAAG AATCACTACA JACGGAAGAA ^AAGGAAAA 
AAGAAGAAGA AGGACACATG ^AGAAGTGCT GCGCTTTGCA CTGGAGGAGA 
GCTCGCTCAT TTCTGTGCTG CAGAAGATAA JGGAGAACCA ^ A GGTTTGGGC 
AGGAGAAGAA G AAAAT GG AG AAGGAGCTTA JGAACTTGGC ^CTAT TGTCCCCCAC 
CCATGATCGG GTGCGACTTG TCCTCCGATG ACTAAGTTGA TCCCOGG^ 
TGAAGAAACT ATGTGCTGTA OTATAGCCGC JGGCTAGCTA ^ CTCAGTQTGA 

GCGATGATTG AGTAATAATG TGTCACGCAT ^ACCAl AATTAAACGT 
GCAATGACCT GAATGAACAA TTGAAATGAA AAGAAAAAM TTTCTGTATA TGTCTAATTT 
TTTAACCTTT TAATAGGTTT ATACAATAAT ™A™ATGT ATGCTAATTT 
GTTATCATCC ATTTAGATAT AGACGAAAAA ^CTAAGA ^TAAAA ATCACCGACA 
GAAATGAAGG GAGTATATAT TGGGATAATG JCGATGAGAT CCCTCGTAA 
TCACACGTGT CCAGTTAATG TATCAGTGAT ^TGTATTC ^A CTCGAGGGGG 
TACCCAACAA TTTTGATCGA C T ATCAGAAA ^CAACGGAA ^G GGCGTAATCA 
GGCCCGGTAC CCAGCTTTTG ^TCCCTTTAG ^AGGGTTAA ™^ CAACATACGA 
TGGTCATAGC TGTTTCCTGT ^GAAATTGT TATCCGCTCA ^ GAGA TTAATT 

GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT ^CTAATGA GCATTAATGA 
GCGTTGCGCT CACTGCCCGC TTTCCAGTCG GGAAACCTGT CGTGCCAGC 
ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG CGTATTGGGC ^TCTTC 

ACTGACTCGC TGCGCTCGGT CGTTCGGCTG CGGCGAGCGG JATCA aGCAAAAGGC 
GTAATACGGT TATCCACAGA JTCAGGGGAT ^GCAGGAA JSAACAT taggctccgc 
CAGCAAAAGG CCAGGAACCG TAAAAAGGCC ^TGCTGG GGTGQCGAAA CCCGACAGGA 
CCCCCTGACG AGCATCACAA AAATCGACGC ^CTCA TGTTCCGACC 
C TAT AAAG AT ACCAGGCGTT JCCCCCTGGA AGCTCCCTCG TGCG ^ GCTTTCTCAT 
CTGCCGCTTA C C GG AT AC C T GTCCGCCTTT CTCCCT1 GGGCTGTGTG 
AGCTCACGCT GTAGGTATCT CAGTTCGGTG ^GTCGTTC ^TCC XCTTGAGTCC 
CACGAACCCC CCGTTCAGCC CGACCGCTGC GOT^ CTGGTAACAG GATTAGCAGA 
GCGAGGTATG TagSc TACAGAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
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AGAAGGACAG TATTTGGTAT CTC™ c^GCCAG TTACCTTCGG = GTT 
GGTAGCTCTT GATCCGGCAA ^AAACCACC GCTGGT CTTTGATCTT TTCTACGGGG 

CAGCAGATTA CGCGCAGAAA ^AG^TCT ™AA xGGTCATGAG ATTATCAAAA 

TCTGACGCTC AGTGGAACGA AAACTCACGT ™GGAT CTAAAGTATA 
AGGATCTTCA CCTAGATCCT TTTAAATTAA ^G^T GTGAGGCACC TATCTCAGCG 
TATGAGTAAA CTTGGTCTGA CAGTTACCAA TGCTTAATCA ^ 
ATCTGTCTAT TTCGTTCATC ^AGTTGCC CGCGAGACC C ACGCTCACCG 

CGGGAGGGCT TACCATCTGG C^AGTGCT GCAATGA CCGAGCGCAG AAGTGGTCCT 

GCTCCAGATT TATCAGCAAT ^CAGC^ AATTGTTGCC GGGAAGCTAG AGTAAGTAGT 
GCAACTTTAT CCGCCTCCAT CCAGTCTATT ^ CAGGCATCGT GGTGTCACGC 

TCGCCAGTTA ATAGTTTGCG CAACGTTGTT ^CAT GATCAAGGCG AGTTACATGA 

TCGTCGTTTG GTATGGCTTC ™AGCTCC ^TCCCAA^ TGTCAGAAG t 
TCCCCCATGT TGTGCAAAAA ^GGTTAGC ^ TGCATAATTC TCTTACTGTC 

AAGTTGGCCG CAGTGTTATC JCTCATGGTT ^GGCA CAACCAAGTC ATTCTGAGAA 

ATGCCATCCG TAAGATGCTT TTCTGTGACT ^TGAGTA ^ TACCGCG CCA 

TAGTGTATGC GGCGACCGAG TTGCTCTTGC CCGGCGTCAA * ^ AAAACTCTCA 

CATAGCAGAA CTTTAAAAGT ^TCATCATT G^AC CTCG TGCACC CAACTGATCT 

AGGATCTTAC CGCTGTTGAG ATCCAGTTCG ATGTAA ^caogaAG GCAAAATGCC 

TCAGCATCTT TTACTTTCAC CAGCGTTTCT GGGTGAGCAA ™ CCTTTTTCAA 
GCAAAAAAGG GAATAAGGGC -CAC^GAAA -TTGAATAC TGAATGTATT 5340 

TAGAAAAATA SESSS "cGC ACATTTCCCC GAAAAGTGCC AC 
(2) INFORMATION FOR SEQ ID NO : 7 : 



4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Other 

( Xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CTAAATTGTA AGCGTTAATA TTTTGTTAAA ATTCGCGT ^ AAATCAGCTC 

ATTTTTTAAC CAATAGGCCG AAATCGGCAA JMCCC™ qtATTAAAGA ACGTGGACTC 
GATAGGGTTG AGTGTTGTTC CAGTTTGGAA CAAGAGTCCA ^ MCCATOCC 
CAACGTCAAA GGGCGAAAAA CCGTCTATCA ^GATO ^^cc CTAAAGGGAG 
CTAATCAAGT TTTTTGGGGT CGAGGTGCCG TAAAGCAC Q ^GAAGAA 

CCCCCGATTT AGAGCTTGAC ^GGAAAGCC ^GAACGTG gtcacgctgc GCGTAACCAC 
AG C G AAAGG A GCGGGCGCTA GGGCGCTGGC ^ G J G CATTCGCCAT TCAGGCTGCG 

CACACCCGCC GCGCTTAATG CGCCGCTACA GGGCGCGTCC TTACGCCAGC TGGCGAAAGG 
CAACTGTTGG GAAGGGCGAT CGGTGCGGGC TTTTCCCAG T CACGACGTTG 

GGGATGTGCT GCAAGGCGAT TAAGTTGGGT J^GCC^ XAGGGCGAAT TGGAGCTCCA 
TAAAACGACG GCCAGTGAGC GCGCGTAATA CGACTCACTA G CCCTAAAGC 
CCGCGGTGGC GGCCGCTCTA GATTATATAA ™TAAGCT ^ TCQTCGCACG 

ACTATCGTAT CACCTATCTA ^TAAGTCAC ^AGT^ CCACACATAA AGGTTTATGT 

GAATTGCATG TTTCTTGTTG ^AAGCATATT ^CAA GGATGCATAT GTATGGTTCT 

ATAAACTTAC ATTTAGCTCA GTTTAATTAC ^TCTTATT ^ AG -p q c AAC AT 

CAATCCATAT AAGTTAGAGT AAAAAATAAG TTATATTGAG GTGAATTTGG 

ATATGGATCT ACAATACTCA TGTGCATCCA ^AAACTA ^ AGGA QTTTCA 

TAGAAATTAA ACTAACTTAC ACACTAAGCC ^CTTl fTCTTTATAA TCAACCCGCA 

ACGATCGTCC CGCGTCAATA TTATTAAAAA JCTCC^AT TAAGGTGAAA 
CTCTTATAAT CTCTTCTCTA CTACTATAAT AAGAGAGTTT ^ CCTAATCAAT 

TTATCTATAA GTGTTCTGGA TATTGGTTGT ^ CATATATATA TATATATCAT 

AGAAAACATA TGTTTTATTA AAACAAAATT JATCATA G xGCAACTTA ATACATGTGT 

ATATATATAT AAACCGTAGC AATGCACGGG C ATATAAC1 A 



120 
180 
240 
300 
360 
420 
480 
540 
600 
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720 
780 
840 
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1140 
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1260 
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GTATTAAGAT GAATAAGAGG GTA = SSSS TAGAAGGAGA 

AGGGGTTGGA AACGATTAAA CGATTAAATC TCTTCCTA^ ^ CTAAAGTGGT 

TTTAATATAT CCCAATCCCC ™ATCATC CAGGTG TGTATCCTTA 

gaggaacacg aaagaaccat ^attggcat ^aaagc attgggtaaq ^caatcaa 

ACAACTCACA GAACATCAAC CAAAATTGCA CG ^ CCGAGATCAT ACTCATCTGA 

ACAAATCCTC TCTGTGTGCA AAGAAACACG <™£^ TATTGCATTA C AAAG AT C GT 
TATACATGCT TACAGCTCAC ^ACATTAC TGACGTGTAA AGTAAATTTA 

TTCATGAAAA ATAAAATAGG CCGGACAGGA TTTTACGTAG ATCAACAACC 

CAACAAAAAA AAAGC CAT AT GTCAAGCTAA ^TAAT TGATTCCAGA TGAACCATCG 

tgtagaaggc aacaaaactg agccacgcag aag^cag caagaaacca TGAAGCTGCC 

ACGTGCTACG TAAAGAGAGT ^GAGTCAT GA ^ GTGT TAATTAATCA AAGCTATAAA 

tacagccgta tcggtggcat aagaacacaa GGGTCTTC ag ACCATTAGCT 

TAACGCTCGC ATGCCTGTGC ^TTCTCCAT ^CCACC GAAGTCGGTG GAGAAGAAAC 

TTATCTACTC CAGAGCGCAG ^AACCCGA GAAGACAGAG TGGCCGGAGT 

CGAAGGGTGT GAAGACAGGT GCGGGTGACA JGCATAA GAAGGA CAAG CCAGAGGCGC 

TGGTGGGGAA ATCGGTGGAG ^GCCAAGA ^ATCC c GACAAGGTC A 

AAAT C AT AGT TCTACCGGTT ^ACJAAGG JGGGTAAGC cagggtcqgc TAGCCTCGAG 
AGCTTTTTGT GGATAAAAAG ^AACATCG ^AGG AG TATAGCCG CTGGCTAGCT 

ATCCCCGGCG GTGTCCCCCA CTGAAGAAAC JATGTG GTGTCACGCA TCACCATGCA 

AGCTAGTTGA GTCATTTAGC GGCGATGATT ^™ ATTGAAATG A AAAGAAAAAA 

TGGGTGGCAG TCTCAGTGTG ^AATGACC ^G TATACAATAA TTGATATATG 

GTATTGTTCC AAATTAAACG ™AACCTT ™™ rp AGA CGAAAA AAAATCTAAG 

TTTTCTGTAT ATGTCTAATT TGTTATCATC CATTTAGA TTGGGATAAT GTCGATGAGA 
AACTAAAACA AATGCTAATT TGAAATGAAG GGAGTATATA ^ tacqtgtaTT 

TCCCTCGTAA TAT C AC C G AC ATCACACGTG ^CAGTTAA ^ AGT C AACGG A 

CACATTTGTT GCGCGTAGGC GT AC C C AAC A. ^TTTGATCG GTGAGGGTTA 
AGCGAGTCGA CCTCGAGGGG GGGCCCGGTA CC^GCTTTT GT ^ TTATCCGCTC 

ATTGCGCGCT TGGCGTAATC ATGGTCATAG ^^CC aagcCTGGGG TGCCTAATGA 

acaattccac acaacatacg agccggaagc ataaagtg ctttccagtc GGGAAAC ctg 

GTGAGCTAAC TCACATTAAT ^GTTGCGC JCACTGC gagqcggttt GCGTATTG GG 

TCGTGCCAGC TGCATTAATG AATCGGCCAA C f G TCGTTCGGCT GCGGCGAGCG 

CGCTCTTCCG CTTCCTCGCT CACTGACTCG CTGCGCT AA TCAGGGGA TAACGCAGGA 

GTATCAGCTC ACTCAAAGGC ^TAATACGG TTATCCA^ qtaaaaaggc CGCGTTG CTG 
AAGAACATGT GAGCAAAAGG CCAGCAAAAG OCfflSQA^ ^tcGACG CTCAAGTCAG 
GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GMCATCACA TTCCC CCTGG AAGCTCCCTC 
AGGTGGCGAA ACCCGACAGG ACTATAAAGA TKC^C TGTCCGCCTT TCTCCCTTCG 
GTGCGCTCTC CTGTTCCGAC ^TGCCGCTT ^CGG TCAGTTCGGT GTAGGTCGTT 

GGAAGCGTGG CGCTTTCTCA ™GCTCACGC ^TftGG CCGACCQCTG C GCCTTATCC 

CGCTCCAAGC TGGGCTGTGT GCACGAACCC ^ TATCGCCACT GGCAGCAGCC 

GGT AACT AT C GTCTTGAGTC ^CCCGGTA AGACA CTACAGAGTT CTTGAAGTGG 

ACTGGTAACA GGATTAGCAG AG ^ A ^GTAT °™ TCTGCGCTCT GCTGAAGCCA 

TGGCCTAACT ACGGCTACAC ^GAAGGACA ^ATTTG AA c AAAC C AC CGCTGGTAGC 

GTTACCTTCG GAAAAAGAGT TGGTAGCTCT ™ff XCAAGAAGAT 
GGTGGTTTTT TTGTTTGCAA ^CAGCAGATT ^GCG ^c^cG TTAAGGGATT 

C CTTTG ATCT TTTCTACGGG GTCTGACGCT ^G AAAA TGAAGT 
TTGGTCATGA G ATT AT C AAA AAGGATCTTC JCCT AC AGTT AC C A ATGCTTAATC 

TTT AAAT C AA TCTAAAGTAT ^ATGAGTAA ACTTGGTC^ ctgacTCCCC 
AGTGAGGCAC CTATCTCAGC CATCTGTCTA ^TC GCCCCAGTG C TGCAATGATA 

GTCGTGTAGA T AAC T AC GAT ^GGGAGGGC ™CAT taaaccagcc AGC CGGAAGG 

CCGCGAGACC CACGCTCACC ^CTCCAGAT ™CA TCCAGTCTAT TAATTGTTGC 

GCCGAGCGCA G AAGTGGTC C TGCAACTTTA ^GC GCAACGTTGT TGCCATTGCT 

CGGGAAGCTA GAGTAAGTAG ^CGCCAGTT ^AGT CATTCAGCTC CGGTTCCCAA 

ACAGGCATCG TGGTGTCACG CTCGTCGTTT ^TATG CGQTTAG CTCCTTCGGT 

CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA ^ T TATGGCAGCA 4680 

CTGCATAATT CTCTTACTGT SSSSS G^ATGCT = AC «^ -J 
=1 SSSS ACATAGCAGA SSSS TG CT C AT CAT TGGAAAACGT 4860 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
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3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
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3780 
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3900 
3960 
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4080 
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§l=liiii=fill 

GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG GGGTTCCGCG CACATTTCCC 
CGAAAAGTGC CAC 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

AGTATAAGTA AAC AC AC CAT CACACCCTTG AGGCCCTTGC TGGTGGCCAT GGTG 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CCTCACATCC CTTAGTGCCT AAGTTCGACG TCGGGCCCTC TAGTCGACGG ATCCA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO -.10: 
AGCGGAAAAT GCCCGAAAGG CTTCCCCAAA TTGGC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 



4920 
4980 
5040 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TGCGCAGGCG TCTGCAAGTG TAAGCTGACT AGTAGCGGAA AATGC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

T AC AAC CTTT GCAAAGTCAA AGGCGCCAAG AAGCTTTGCG CAGGCGTCTG 

(2) INFORMATION FOR SEQ ID NO:l3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GCAAGAGTTG CTGCAAGAGT ACCCTGGGAA GGAAGTGCTA CAACCTTTGC 
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